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Descripti n 

COPYRIGHT NOTIFICATION 

Portions of this patent application contain materials that are subject to copyright protection. The copyright owner 
has no objection to the facsimile reproduction by anyone of the patent document, or the patent disclosure, as it appears 
in the Patent and Trademark Office. 

Field of the Invention 

This invention relates to a system, method and apparatus for improving response time of a network based transac- 
tion session. More particularly, the invention relates to a system, method and apparatus to alleviate network load by use 
of a location-independent-indexedcache to eliminate repetitive and/or redundant transfers of data objects already resi- 
dent in the cache. 

Background of the Invention 

In a number of interactive network applications, it is common for a user to request access to data that is housed on 
a remote site, but which data has previously been transferred to the user during an earlier session, or an earlier point 
of time in the present session. The process of requesting the transfer of such information over the network consumes 
significant time and network resources, particularly if the resource requested is of a substantial size. Therefore, it is 
desirable to provide a means to recognize that a particular resource stored at a remote location is already available on 
the users local computer system, and to obtain the desired copy from that local version, rather than performing a net- 
work call to obtain a new identical copy. In the past, programs such as World Wide Web browsers (such as Sun Micro- 
systems, Inc/s HotJava, Netscape Communications Corp.'s Netscape Navigator and Microsoft Corp.'s Internet 
Explorer) frequently have operated using a cache indexed by a Uniform Resource Locator (URL). By means of this 
cache, the WWW browser can recognize when a particular URL has been previously referenced, and may present data 
that had been previously obtained in response to the previous request for the same URL. 

However, one shortcoming of this approach is that a URL-indexed cache is capable of providing the user with the 
previously recovered copy of the requested resource only when the second request for the resource is requesting that 
resource from the exact location as that from which the resource was initially obtained. That is, if the user requests that 
a resource be obtained from a new location, a WWW browser equipped with a URL-indexed cache will obtain the 
requested resource from the specified location, even if the same data is already resident on the user's computer system 
from a previous transfer from a different location. This approach results in an undesirable amount of redundant data 
transfers. 

This problem is particularly aggravated when the resource requested is not requested under the user's direct con- 
trol. An example of such a case is where a user requests a particular resource, and that resource in turn requires addi- 
tional resources. The user has no opportunity to indicate to the system that previously obtained resources should be 
used instead of redundant copies of the secondary resources. For example, a user may request a particular web page 
to be displayed, and that page may in turn request transmission of a graphic image to be displayed, or a program (often 
referred to as an "applet") to be executed in conjunction with the displayed page. The user has no opportunity to indicate 
to the browser that a previously obtained image file or applet file should be used in lieu of the file located at the remote 
location. 

It is desirable, therefore, to support a remotely located resource that resides on a plurality of computer systems that 
are identified by a location-independent identifier, thereby allowing reuse of a previously obtained copy of that resource, 
even if the previously obtained copy was obtained from a different location. 

SUMMARY OF THE INVENTION 

A system, method, and apparatus for obtaining a copy of a data object is disclosed. A location-independentidenti- 
f ier associated with the desired data object is obtained, for example, from a primary file that requires a copy of the data 
object. A cache is interrogated to determine whether a copy of the data object is cached. If the data object is cached, a 
copy of the cached data object is obtained from the cache. H the data object is not cached, a network call is performed 
obtain a new copy of the data object. 

Additional features of the invention will become apparent upon examination of the description that follows, particu- 
larly with reference to the accompanying drawings. 
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DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects, aspects and advantages are better understood from the following detailed 
description of a preferred embodiment of the invention with reference to the drawings, in which: 

5 

Figure 1 is a block diagram of a representative hardware environment in accordance with a preferred embodiment; 
Figure 2 depicts a client computer operating a network client program in communication with a server computer; 
10 Figure 3 depicts an example of a downloaded HTML document; 

Figure 4 depicts an operation of obtaining an applet specified by an applet tag; 
Figure 5 depicts the server response to an HTTP GET request; 

15 

Figure 5A depicts the format of a Java applet class file; 

Figure 5B depicts the format of a GIF Application Extension data area; 

K 20 Figure 6 depicts transmission of a second HTML document from a second HTTP server to the client computer over 
a communications link; 

Figure 7 depicts an example of a second HTML document specifying retrieval of an applet; 
25 Figure 8 depicts the process by which the client computer obtains a copy of the desired object from a cache; 
Figure 9 is a flow chart depicting the overall operation of the invention; and 

Figure 10 depicts an embodiment of the present invention in which a single cache is shared by a plurality of clients. 

30 

DETAILED DESCRIPTION 

A preferred embodiment of a system in accordance with the present invention is preferably practiced in the context 
of a personal computer such as the IBM PS/2, Apple Macintosh computer or UNIX based workstation. A representativ 
35 hardware environment is depicted in Figure 1 , which illustrates a typical hardware configuration of a workstation in 
accordance with a preferred embodiment having a central processing unit 1 0, such as a microprocessor, and a number 
of other units interconnected via a system bus 1 2. The workstation shown in Figure 1 includes a Random Access Mem- 
ory (RAM) 1 4, Read Only Memory (ROM) 1 6. an I/O adapter 18 for connecting peripheral devices such as disk storage 
\ units 20 to the bus 12, a user interface adapter 22 for connecting a keyboard 24, a mouse 26, a speaker 28. a micro- 
40 phone 32, and/or other user interface devices such as a touch screen (not shown) to the bus 12. communication 
adapter 34 for connecting the workstation to a communication network (e.g., a data processing network) and a display 

adapter 36 for connecting the bus 12 to a display device 38. The workstation typically hasTesiderit thereon an operating 

system such as the Microsoft Windows Operating System (OS), the IBM OS/2 operating system, the MAC OS, or UNIX 
operating system. Those skilled in the art will appreciate that the present invention may also be implemented on plat- 
45 forms and operating systems other than those mentioned. 

A preferred embodiment is written using Java. C. and the C++ language and utilizes object oriented programming 
methodology. Object oriented programming (OOP) has become increasingly used to develop complex applications As 
OOP moves toward the mainstream of software design and development, various software solutions will need to be 
adapted to make use of the benefits of OOP. A need exists for these principles of OOP to be applied to a messaging 
so interface of an electronic messaging system such that a set of OOP classes and objects for the messaging interfac 
can be provided. 

OOP is a process of developing computer software using objects, including the steps of analyzing the problem, 
designing the system, and constructing the program. An object is a software package that contains both data and a col- 
lection of related structures and procedures. Sine it contains both data and a collection of structures and procedures. 
55 it can be visualized as a self-sufficient component that does not require other additional structures, procedures or data 
to perform its specific task. OOP, therefore, views a computer program as a collection of largely autonomous compo- 
nents, called objects, each of which is responsible for a specific task. This concept of packaging data, structures, and 
procedures together in one component or module is called encapsulation. 
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In general, OOP components are reusable software modules which present an interface that conforms to an object 
model and which are accessed at run-time through a component integration architecture. A component integration 
architecture is a set of architecture mechanisms which allow software modules in different process spaces to utilize 
each others capabilities or functions. This is generally done by assuming a common component object model on which 
to build the architecture. 

It is worthwhile to differentiate between an object and a class of objects at this point. An object is a single instance 
of the class of objects, which is often just called a class. A class of objects can be viewed as a blueprint, from which 
many objects can be formed. L . ^ . 

OOP allows the programmer to create an object that is a part of another object. For example, the object represent- 
ing a piston engine is said to have a composition-relationship with the object representing a piston. In reality, a piston 
engine comprises a piston, valves and many other components; the fact that a piston is an element of a piston engine 
can be logically and semantically represented in OOP by two objects. 

OOP also allows creation of an object that "depends from- another object. If there are two objects, one representing 
a piston engine and the other representing a piston engine wherein the piston is made of ceramic, then the relationship 
between the two objects is not that of composition. A ceramic piston engine does not make up a piston engine. Rather 
it is merely one kind of piston engine that has one more limitation than the piston engine; its piston is made of ceramic. 
In this case, the object representing the ceramic piston engine is called a derived object, and it inherits all of the aspects 
of the object representing the piston engine and adds further limitation or detail to it. The object representing the 
ceramic piston engine "depends from" the object representing the piston engine. The relationship between these 

objects is called inheritance. 

When the object or class representing the ceramic piston engine inherits all of the aspects of the objects represent- 
ing the piston engine, it inherits the thermal characteristics of a standard piston defined in the piston engine class. How- 
ever the ceramic piston engine object overrides these ceramic specific thermal characteristics, which are typically 
different from those associated with a metal piston. It skips over the original and uses new functions related to ceramic 
pistons. Different kinds of piston engines will have different characteristics, but may have the same underlying functions 
associated with it (e.g.. how many pistons in the engine, ignition sequences, lubrication, etc.). To access each of these 
functions in any piston engine object, a programmer would call the same functions with the same names, but each type 
of piston engine may have different/overridingimplementations of functions behind the same name. This ability to hide 
different implementations of a function behind the same name is called polymorphism and it greatly simplifies commu- 
nication among objects. 

With the concepts of composition-relationship, encapsulation, inheritance and polymorphism, an object can repre- 
sent just about anything in the real world. In fact, our logical perception of the reality is the only limit on determining the 
kinds of things that can become objects in object-oriented software. Some typical categories are as follows: 

- Objects can represent physical objects, such as automobiles in a traffic-flow simulation, electrical components in a 
circuit-design program, countries in an economics model, or aircraft in an air-traffic-control system. 

- Objects can represent elements of the computer-user environment such as windows, menus or graphics objects. 

- An object can represent an inventory, such as a personnel file or a table of the latitudes and longitudes of cities. 

- An object can represent user-defined data types such as time, angles, and complex numbers, or points on the 
plane. 

With this enormous capability of an object to represent just about any logically separable matters, OOP allows the 
software developer to design and implement a computer program that is a model of some aspects of reality, whether 
that reality is a physical entity, a process, a system, or a composition of matter. Since the object can represent anything, 
the software developer can create an object which can be used as a component in a larger software project in the 

future. , 

If 90% of a new OOP software program consists of proven, existing components made from preexisting reusable 
objects, then only the remaining 10% of the new software project has to be written and tested from scratch. Since 90% 
already came from an inventory of extensively tested reusable objects, the potential domain from which an error could 
originate is 10% of the program. As a result, OOP enables software developers to build objects out of other, previously 
built, objects. 

This process closely resembles complex machinery being built out of assemblies and sub-assemblies. OOP tech- 
nology therefore, makes software engineering more like hardware engineering in that software is built from existing 
components, which are available to the developer as objects. All this adds up to an improved quality of the software as 
well as an increased speed of its development. 

Programming languages are beginning to fully support the OOP principles, such as encapsulation, inheritance, pol- 
ymorphism, and composition-relationship. With the advent of the C++ language, many commercial software developers 
have embraced OOP C++ is an OOP language that offers a fast, machine-executable code. Furthermore. C++ is sutt- 
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able for both commercial-application and systems-programming projects. For now, C++ appears to be the most popular 
choice among many OOP programmers, but there is a host of other OOP languages, such as Smalltalk, common lisp 
object system (CLOS), and Eiffel. Additionally, OOP capabilities are being added to more traditional popular computer 
programming languages such as Pascal 
5 The benefits of object classes can be summarized, as follows: 

Objects and their corresponding classes break down complex programming problems into many smaller, simpler 
problems. 

Encapsulation enforces data abstraction through the organization of data into small, independent objects that can 
10 communicate with each other. Encapsulation protects the data in an object from accidental damage, but allows 
other objects to interact with that data by calling the object's member functions and structures. 
Subclassing and inheritance make it possible to extend and modify objects through deriving new kinds of objects 
from the standard classes available in the system. Thus, new capabilities are created without having to start from 
scratch. 

is - Polymorphism and multiple inheritance make it possible for different programmers to mix and match characteristics 
of many different classes and create specialized objects that can still work with related objects in predictable ways. 
Class hierarchies and containment hierarchies provide a flexible mechanism for modeling real-world objects and 
the relationships among them. 

Libraries of reusable classes are useful in many situations, but they also have some limitations. For example: 
^ ; 20 - Complexity. In a complex system, the class hierarchies for related classes can become extremely confusing, with 
many dozens or even hundreds of classes. 

Flow of control. A program written with the aid of class libraries is still responsible for the flow of control (i.e., it must 
control the interactions among all the objects created from a particular library). The programmer has to decide 
which functions to call at what times for which kinds of objects. 

25 - Duplication of effort. Although class libraries allow programmers to use and reuse many small pieces of code, each 
programmer puts those pieces together in a different way. Two different programmers can use the same set of class 
libraries to write two programs that do exactly the same thing but whose internal structure tf.e., design) may be 
quite different, depending on hundreds of small decisions each programmer makes along the way. Inevitably, sim- 
ilar pieces of code end up doing similar things in slightly different ways and do not work as well together as they 

30 should. 

Class libraries are very flexible. As programs grow more complex, more programmers are forced to reinvent basic 
solutions to basic problems over and over again. A relatively new extension of the class library concept is to have a 
framework of class libraries. This framework is more complex and consists of significant collections of collaborating 
35 classes that capture both the small scale patterns and major mechanisms that implement the common requirements 
and design in a specific application domain. They were first developed to free application programmers from the chores 
involved in displaying menus, windows, dialog boxes, and other standard user interface elements for personal comput- 
ers. 

i Frameworks also represent a change in the way programmers think about the interaction between the code they 

40 write and code written by others In the early days of procedural programming, the programmer called libraries provided 
by the operating system to perform certain tasks, but basically the program executed down the page from start to finish, 
— — —and the programmer was solely responsible for the flow of control. This was appropriate for printing out paychecks, cal" 
culating a mathematical table, or solving other problems with a program that executed in just one way. 

The development of graphical user interfaces began to turn this procedural programming arrangement inside out. 

45 These interfaces allow the user, rather than program logic, to drive the program and decide when certain actions should 
be performed. Today, most personal computer software accomplishes this by means of an event loop which monitors 
the mouse, keyboard, and other sources of external events and calls the appropriate parts of the programmer's code 
according to actions that the user performs. The programmer no longer determines the order in which events occur. 
Instead, a program is divided into separate pieces that are called at unpredictable times and in an unpredictable order. 

so By relinquishing control in this way to users, the developer creates a program that is much easier to use. Nevertheless, 
individual pieces of the program written by the developer still call libraries provided by the operating system to accom- 
plish certain tasks, and the programmer must still determine the flow of control within each piece after it's called by the 
event loop. Application code still ''sits on top of the system. 

Even event loop programs require programmers to write a lot of code that should not need to be written separately 

55 for every application. The concept of an application framework carries the event loop concept further. Instead of dealing 
with all the nuts and bolts of constructing basic menus, windows, and dialog boxes and then making these things all 
work together, programmers using application frameworks start with working application code and basic user interface 
elements in place Subsequently, they build from there by replacing some of the generic capabilities of the framework 
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with the specific capabilities of the intended application. 

Application frameworks reduce the total amount of code that a programmer has to write from scratch. However, 
because the framework is really a generic application that displays windows, supports copy and paste, and so on, the 
programmer can also relinquish control to a greater degree than event loop programs permit. The framework code 
takes care of almost all event handling and flow of control, and the programmer's code is called only when the frame- 
work needs it (e.g., to create or manipulate a proprietary data structure). 

A programmer writing a framework program not only relinquishes control to the user (as is also true for event loop 
programs), but also relinquishes the detailed flow of control within the program to the framework. This approach allows 
the creation of more complex systems that work together in interesting ways, as opposed to isolated programs, having 
custom code, being created over and over again for similar problems. 

Thus, as is explained above, a framework basically is a collection of cooperating classes that make up a reusable 
design solution for a given problem domain It typically includes objects that provide default behavior (e.g., for menus 
and windows), and programmers use it by inheriting some of that default behavior and overriding other behavior so that 
the framework calls application code at the appropriate times. 
There are three main differences between frameworks and class libraries: 

Behavior versus protocol. Class libraries are essentially collections of behaviors that you can call when you want 
those individual behaviors in your program. A framework, on the other hand, provides not only behavior but also the 
protocol or set of rules that govern the ways in which behaviors can be combined, including rules for what a pro- 
grammer is supposed to provide versus what the framework provides. 

Call versus override. With a class library, the code the programmer writes instantiates objects and calls their mem- 
ber functions It's possible to instantiate and call objects in the same way with a framework (i.e., to treat the frame- 
work as a class library), but to take full advantage of a framework's reusable design, a programmer typically writes 
code that overrides and is called by the framework. The framework manages the flow of control among its objects. 
Writing a program involves dividing responsibilities among the various pieces of software that are called by the 
framework rather than specifying how the different pieces should work together. 
- Implementation versus design. With class libraries, programmers reuse only implementations, whereas with frame- 
works, they reuse design; A framework embodies the way a family of related programs or pieces of software work. 
It represents a generic design solution that can be adapted to a variety of specific problems in a given domain. For 
example, a single framework can embody the way a user interface works, even though two different user interfaces 
created with the same framework might solve quite different interface problems. 

Thus, through the development of frameworks for solutions to various problems and programming tasks, significant 
reductions in the design and development effort for software can be achieved. A preferred embodiment of the invention 
utilizes Hypertext Markup Language (HTML) to implement documents on the Internet together with a general-purpose 
secur communication protocol for a transport medium between the client and the merchant. HTML is a simple data 
format used to create hypertext documents that are portable from one platform to another. HTML documents are SGML 
documents with generic semantics that are appropriate for representing information from a wide range of domains. 
HTML has been in use by the World-Wide Web global information initiative since 1990. HTML is an application of ISO 
Standard 8879:1986 Information Processing Text and Office Systems; Standard Generalized Markup Language 
(SGML). 

To date, Web development tools have been limited in their ability to create dynamic Web applications which span 
from client to server and interoperate with existing computing resources. Until recently, HTML has been the dominant 
technology used in development of Web-based solutions. However, HTML has proven to be inadequate in the following 
areas: 

o Poor performance; 

o Restricted user interface capabilities; 

o Can only produce static Web pages; 

o Lack of interoperability with existing applications and data; and 
o Inability to scale. 

Sun Microsystem's Java language solves many of the client-side problems by: 

o Improving performance on the client side; 

o Enabling the creation of dynamic, real-time Web applications; and 

o Providing tile ability to create a wide variety of user interface components. 
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With Java, developers can create robust User Interface (Ul) components. Custom ^widgets" (e.g. real-time stock 
tickers, animated icons, etc.) can be created, and client-side performance is improved. Unlike HTML, Java supports the 
notion of client-side validation, offloading appropriate processing onto the client for improved performance. Dynamic, 
real-time Web pages can be created. Using the above-mentioned custom Ul components, dynamic Web pages can 
5 also be created. 

Sun's Java language has emerged as an industry-recognized language for "programming the Internet." Sun 
defines Java as: "a simple, object-oriented, distributed, interpreted, robust, secure, architecture-neutral, portable, high- 
performance, multithreaded, dynamic, buzzword-compliant, general-purpose programming language. Java supports 
programming for the Internet in the form of platform-independent Java applets." Java applets are small, specialized 

w applications that comply with Sun's Java Application Programming Interface (API) allowing developers to add "interac- 
tive content" to Web documents (e.g. simple animations, page adornments, basic games, etc.). Applets execute within 
a Java-compatible browser (e.g. Netscape Navigator) by copying code from the server to client. From a language stand- 
point, Java's core feature set is based on C++. Sun s Java literature states that Java is basically "C++, with extensions 
from Objective C for more dynamic method resolution". 

is Another technology that provides similar function to Java is provided by Microsoft and ActiveX Technologies, to give 
developers and Web designers wherewithal to build dynamic content for the Internet and personal computers. ActiveX 
includes tools for developing animation, 3-D virtual reality, video and other multimedia content. The tools use Internet 
standards, work on multiple platforms, and are being supported by over 1 00 companies, The group's building blocks are 
called ActiveX Controls, small, fast components that enable developers to embed parts of software in hypertext markup 

20 language (HTML) pages. ActiveX Controls work with a variety of programming languages including Microsoft Visual 
C++, Borland Delphi, Microsoft Visual Basic programming system and, in the future, Microsoft's development tool for 
Java, code named "Jakarta." ActiveX Technologies also includes ActiveX Server Framework, allowing developers to 
create server applications. One of ordinary skill in the art will readily recognize that ActiveX could be substituted for Java 
without undue experimentation to practice the invention. 

25 Figure 2 depicts a client computer 21 0 operating a network client program, such as a web browser, in communica- 
tion with server computer 215. Server 215 may be, for example, a hypertext transport protocol ("HTTP") server. Server 
21 5 is named "SYSA" This denotes that the server is addressed with a name such as SYSA, or is located externally t 
the client's network, for example, in a system with the name SYSA.COM. Server 215 is in communication with client 
210 using a communications link 220 operating, for example, using the HTTP protocol and the Transmission Control 

30 Protocol and Internet Protocol (TCP" and "IP," respectively, or collectively "TCP/IP"). HTTP is described in R. Fielding, 
et al., Hypertext Transfer Protocol: HTTP/ 1. 1 (June 3, 1996) (draft), the disclosure of which is hereby incorporated by 
reference. TCP is described in Information Sciences Institute, RFC 793: Transmission Control Protocol DARPA Internet 
Program Protocol Specification (September 1981), the disclosure of which is hereby incorporated by reference. IP is 
described in Information Sciences Institute, RFC 791: Internet Protocol DARPA Internet Program Protocol Specification 

35 (September 1981), the disclosure of which is hereby incorporated by reference. 

As depicted in Figure 2, server 215 is transferring a copy of document 225, named PAGE1.HTML, to client 210, in 
response to a previous HTTP GET request (not shown) by client 210. Client 210 is equipped with a cache file 230 
indexed by a cache table 235. Cache table 235 is a table used to index cache 230. Cache table 235 comprises a plu- 
rality of rows 238 and columns 240. Each row 238 is used to describe a cached element, such as downloaded HTML 

40 document 225. Columns 240 include a URI column 245. an OID column 247, and a cache pointer column 249. The use 
of the cache table will be described in further detail below, however, in summary, URI column 245 contains a represen- 
""tatiorfof a^locatioh^dde^of a racrFedT^^urcerTypirally^this^will be aTrepfesentatioh of a^Uhifdfm Resource Identifier" 
("URI"), such as a Uniform Resource Locator ("URL"), or any other indicator of the location of the cached resource. The 
syntax and semantics of URLs are described in Berners-Lee, et al., RFC 1738: Uniform Resource Locators (URL) 

45 (December 1994), the disclosure of which is hereby incorporated by reference. 

OID column 247 contains an object identifier, if any for the cached resource, or is null if no object identifier is asso- 
ciated with the cached resource. Cache pointer column 249 contains a pointer to the cache element in the cache. This 
may be, for example, the name of a file located within a specific predetermined cache directory or cache device, or it 
may be a specification of a location within a larger file of a particular portion of the file corresponding to the cached 

so resource In addition, the cache table will typically contain additional fields used to manage and optimize a cache table, 
such as indicators of whether a particular cache row 238 is in use, its frequency of reference, etc. Such fields are not a 
portion of the present invention and are not herein described. 

As depicted in Figure 2, client 210 transfers a copy of downloaded file 225 to cache 230 as shown by arrow 260. 
As depicted in Figure 2, a copy of document 225 is cached at location 265. and cache table row 238-1 is updated to 

ss reflect the cache. In particular, for cache table row 238-1 , URI column 245 is set to the value of the URI for the retrieved 
document. Column 247 is set to null because, as will be described in further detail, no OID has been associated with 
this resource. Column 249 has been set to point to cached element 265. 

Figure 3 depicts an example of downloaded document 225. In the depicted example, the file is in Hypertext Markup 
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Language ("HTML"). The HTML format is described in Berners-Lee, et al., RFC 1866: Hypertext Markup Language - 
2.0 (November 1995), which is hereby incorporated by reference. The HTML document 225 comprises a plurality of 
HTML tags 310 and text 320 to be rendered and displayed. With the exception of the applet tag 330, all of HTML tags 
310 are known and described in Berners-Lee, and are not further described herein. Applet tag 330 is an HTML tag that 

5 is used to specify an applet that should be loaded and executed when HTML document 225 is rendered and displayed 
Specifically applet tag 330 is an extension to HTML devised by Sun Microsystems, Inc. of Mountain View, California. 
Typically, applets are written in a platform-independentobject-oriented language such as Sun Microsystem's Java lan- 
guage. A Java applet is generally transported in the Sun's CLASS-format file. Applet tag 330 contains four parameters: 
CODE parameter 340, WIDTH parameter 342, HEIGHT parameter 344, and OID parameter 350. The CODE, WIDTH, 

10 and HEIGHT parameters are known in the art. The CODE parameter 340 is used to specify the name of a program file 
to be downloaded from the same server location as HTML document 225. In the example depicted, CODE parameter 
340 indicates that the program named FOO. CLASS is to be obtained from the SYSA server 21 5 because the web page 
was part of the HTML information. WIDTH parameter 342 and HEIGHT parameter 3*4 indicate the size of screen area 
where the output of the applet will be displayed. In the example shown, an area 300 pixels wide and 100 pixels high is 

15 specified. 

The OID parameter 350 is not previously known. OID parameter 350 is used to specify an object identifier. The 
object identifier is a unique identifier associated with a particular object, The OID associated with a particular object is 
guaranteed to be unique. A preferred method of specifying the OID is using Abstract Syntax Notation One ("ASN. 1"), 
as described in ISO 8824, and the ASN. 1 Basic Encoding Rules ("BER") as described in ISO 8825. ASN. 1 and BER 

20 provid for an "object identifier" datatype. \ ,J 

The ASN. 1 object identifier is in the form of a series of integers separated by decimal points. Each integer repre- 
sents a node on an ASN. 1 object identifier tree. The ASN. 1 object identifier tree is a structure with a root node, arcs 
beneath that node to other nodes, with arcs beneath them and so on. As specified in the ASN. 1 standard, each node 
is assigned to some responsible body that allocates arcs and nodes beneath it. The body ensures that ail the arcs 

25 beneath its node are numbered sequentially starting from 0 or 1 and that each node beneath it is either assigned to 
some responsible body or is assigned to name a particular object. In the example shown, the OID has a value of 
1.1.999999.72.6.3. The first integer, 1. has been allocated to the International Standards Organization ("ISO"). The 
ISO, therefore, has the responsibility for allocating every OID beginning with the integer 1. The second integer 1 indi- 
cates that this object is within an ASN. 1 hierarchy allocated to a "registration authority" ("RA"). A registration authority 

30 is an organization that is responsible for the allocation of any further ASN. 1 OlDs within its name space. In the example 
shown, the fictitious registration authority code 999999 is used to depict that a particular responsibility authority having 
been assigned a code 999999 is responsible for any object identifiers below this level The remaining integers (72, 6, 
and 3) are arbitrary integers organized according to a functional scheme selected by the registration authority corre- 
sponding to RA code 999999, and guaranteed by the registration authority to be unique in the world. ASN.1 , BER, the 

35 ASN.1 object identifier, and the ASN.1 object identifier tree are known constructs and are described in their respective 
standards and, for example, in Larmouth, Understanding OS/(lnternational Thomson Computer Press, 1996), pp. 151- 
160, hereby incorporated by reference. 

It will be seen from Figure 3, therefore, that the desired applet may be referred to either by a particular known loca- 
tion (file name FOO.CLASS on server SYSA) or by its ASN.1 object identifier (1 .1 .999999.72.6.3). / \ 

40 Figure 4 depicts an operation of obtaining the applet specified by applet tag 330 of Figure 3. In the example 
depicted, the applet specified by applet tag 330 has not been previously obtained, and therefore is unavailable from 
cache 230 of Figure 2. Therefore, as depicted in Figure 4, client 210 issues an HTTP GET request for the resource 
named FOO.CLASS on connection 220 to server SYSA 215. 

Figure 5 depicts the server response to the GET request. Server 215 transmits a copy 510 of the FOO.CLASS 

45 object on communications line 220 to client computer 210. Client computer 210 places a copy of received file 510 in 
cache 230 as cache element 520. Client computer 210 then updates cache table 235 to reflect the cache update. Spe- 
cifically, client computer 210 updates row 238-2 to reflect newly added element 520. 

Client computer 210 updates cache table row 238-2 as follows. URI column 245 is set to the value of the URI from 
which the file was retrieved. OID column 247 is updated with the object identifier value for the object received: Finally, 

so column 249 is updated with a pointer to the cache element 520 within cache 230. 

Generally, an object that is received from a server may have an OID encoded into it, so that a client computer such 
as client computer 210 can discover the object's OID even if no OID had been specified prior to the transfer. The OID 
may be encoded, for example, in a predetermined position or field within the object, or may be stored on the server sep- 
arately from the object and included by the server by encoding it into the HTTP GET response 

55 The Java CLASS format is highly structured and is particularly well-suited for the purpose of encoding an OID into 
an applet file. The Java CLASS format is described in Sun Microsystem's Java Virtual Machine Specification, Release 
1.0 Beta DRAFT(August 21, 1995) the disclosure of which is hereby incorporated by reference. 

Each class file contains the compiled version of either a Java class or a Java interface. An interpreter or "virtual 



8 



BNSDOCID: <EP 083481 8A2J_> 



EP 0 834 818 A2 



machine" designed to execute a Java applet supports all class files that conform to this format. 

A Java class file comprises a stream of 8-bit bytes. All 1 6-bit and 32-bit quantities are constructed by reading in two 
or four 8-bit bytes, respectively. The bytes are joined together in network (big-endian) order, where the high bytes come 
first. This format is supported by the Java java.io.Datalnput and java.io.DataOutputinterfaces, and classes such as 
5 java.io.DatalnputStream arrijava.io.DataOutputstream. 

The class file format is described here using a structure notation. Successive fields in the structure appear in the 
external representation without padding or alignment Variable size arrays, often of variable sized elements are called 
tables and are commonplace in these structures The types u1, u2, and u4 mean an unsigned one-, two-, or four-byte 
quantity, respectively, which are read by method such as readUnsignedByte. readUnsignedShort and readlnt of the 
10 java.io.Datalnputinterface. 

Figure 5A depicts the format of a class file 560, which is structured as follows: 



ClassFile { 



15 



20 



25 



30 



35 



\ 

40 



45 



50 



u4 magic; x 

u2 minor_version; 

u2 major_version; 

u2 constant_pool_count; 

cp_info constant_pool(constant_pool_couixt- 1]; 

u2 accessjflags; 

u2 this.class; 

u2 superclass; 

u2 iiiterfaces_count; 

u2 interfacespnterfaces.count]; 

u2 ficlds_count; 

field_info fields[fie!ds_count]; 

u2 methods_count; 



methocLinfo methods[methods_count]; 
u2 attribiatcs.coTJLnt; 

attribute_info attributes[attribute_count]; 



55 magic 



The "magic" field 561 is four bytes in length and is used to identify the file as a Java class-format file. The magic 
field has the value OxCAFEBABE. 
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minor versi n and major version 

The minor_version field 562 and major_version field 563 contain the version number of the Java compiler that pro- 
duced this class file. The combination of the two fields may be interrogated by a virtual machine to determine whether 
5 it is capable of executing the applet. An implementation of the virtual machine will normally support some range of 
minor version numbers 0-n of a particular major version number. If the minor version number is incremented, the new 
code won't run on the old virtual machines, but it is possible to make a new virtual machine which can run versions up 
to version number n+1 . A change of the major version number indicates a major incompatible change, one that requires 
a different virtual machine that may not support the old major version in any way. 

10 

constant_pooLcount 

The constant jaooLcountfield 564 indicates the number of entries in the constant pool 565 in the class file. 
15 constant__pool 

The constant pool 565 is a table of values The values in the constant pool 565 comprise various string constants, 
classnames, field names, and others that are referred to by the class structure or by the executable code in the applet. 
The first constant pool entry, denoted as constant_jx>ol[0], is always unused by the compiler, and may be used by an 
20 implementation for any purpose. 

Each of the constant_pool entries 1 through constant_pooLcount-1 is a variable-length entry, whose format is indi- 
cated by the first "tag" byte, according to the following table: 



Value 


Constant Type 


Meaning 


1 


CONSTANTJJtf8 


utf-8 format string 


2 


CONSTANT_Unicode 


Unicode format string 


3 


CONSTANT Jnteger 


integer 


4 


CONSTANT_Float 


floating point 


5 


CONSTANTJjong 


long integer 


6 


CONSTANT_Double 


double floating point 


7 


CONSTANT_Class 


class 


8 


CONSTANT_String 


string 


9 


CONSTANT_Fieldref 


field reference 


10 


CONSTANTJ/lethodref 


method reference 


11 


CONSTANTJnterfaceMethodref 


interface method reference 


12 


CONSTANT_NameAndType 


name and type 



45 A utf-8 format string constant pool entry represents a constant character string value. Utf-8 strings are encoded so 
that strings containing only non-null ASCII characters, can be represented using only one byte per character, but char- 
acters of up to 16 bits can still be represented. 

All characters in the range 0x0001 to Ox007F are represented by a single byte, in which bit 0 is set to binary '0* and 
in which bits 1 -7 represent the ASCII code 0x0001 to 0x007F, respectively. The null character 0x0000 and characters 
so in th range 0x0080 to 0x07FF are represented by a pair of two bytes, or 1 6 bits, denoted here as bits 0-15- Bits 0-2 are 
set to binary '1 1 0* and bits 8-9 are set to binary '1 0\ The remaining eleven bits 3-7 and 10-15 correspond respectively 
to the low-order eleven bits in the character to be encoded. 

Characters in the range 0x0800 to OxFFFF are represented by three bytes, or 24 bits, denoted here as bits 0-23. 
Bits 0-3, 8-9 and 16-17 are set to binary values '1 110', '10\ and '10', respectively. The remaining 16 bits 4-7, 10-15 and 
55 1 8-23 correspond to the 1 6 bits in the character to be encoded. 

The null character 0x00 is encoded in two-byte format rather than one-byte, with the result that encoded strings 
never have embedded nulls. Only one-byte, two-byte, and three-byte formats are used; longer utf-8 formats are unrec- 
ognized. 
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A utf-8 string is structured as follows: 

CONSTANT_UtfB_info { 
5 ul tag; 

u2 length; 

ul bytes [length]; 

} 



10 



is The tag field has the constant value 0x*0001 ' indicating a utf-8 encoded string, "me length field is a two-byte field 
indicating the length of the string The bytes field is the encoded string. 

A UNICODE string constant pool entry represents a constant unencoded character string value. A UNICODE string 
is structured as follows: 



20 



25 



30 



CONSTANT_Unicode_info{ 
ul tag; 
u2 length; 
ul bytes [length]; 

} 



The tag field has the constant value 0xT)002' indicating a unicode-format string. The length field is a two-byte field 
indicating the length of the string. The bytes field is the string value. 
35 An integer constant pool entry represents a four-byte integer. The constant pool entry is structured as follows: 

CONSTANT_Integer_inf> { 

40 

ul tag; 

45 u4 bytes; 

} 



50 



The tag field has the constant value Ox'OOOS' indicating a integer. The bytes field is the integer value. 
A float constant pool entry represents a four-byte floating-point number. The constant pool entry is structured as 
follows: 
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CONSTANT_Float_info { 
s ul tag; 

u4 bytes; 

} 

10 



The tag field has the constant value Ox'0004' indicating a floating-point number The bytes field is the floating-point 
75 value. 

A long integer constant pool entry represents an eight-byte integer. The constant pool entry is structured as follows: 



20 



25 



30 



CONSTANT JLongJnfo { 
ul tag; 

u4 highjbytes; 
u4 lowLbytes; 

} 



The tag field has the constant value Ox'OOOS' indicating a long integer. The high_bytes and low_bytes fields 
together make up the integer value. A long integer constant pool entry takes up two spots in the constant pool 565. If 
this is the nth entry in the constant pool 565. then the next entry will be numbered n+2. 

A double float constant pool entry represents an eight-byte floating-point number. The constant pool entry is struc- 
35 tured as follows: 

CONSTANTJDouble Jnfo { 

u4 highjbytes; 
u4 low_bytes; 

} 



The tag field has the constant value Ox'0006' indicating a double floating-point number. The high_bytes and 
low_bytes fields together make up the floating-point value. A double float constant pool entry takes up two spots in the 
constant pool 565. If this is the nth entry in the constant pool 565, then the next entry will be numbered n+2. 

A class constant pool entry represents a Java class or an interface The constant pool entry is structured as follows: 
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CONSTANT_Class_info{ 
ul tag; 

u2 nam.e_index; 



10 



15 

The tag field has the constant value Ox'0007' indicating a class. The name_index field is a subscript into the con- 
stant pool 565, to a utf-8 format string constant that gives the string name of the class. 

A string constant pool entry represents Java objects of the built-in Java type "String." The constant pool entry is 
20 structured as follows: 

CONSTANT_SteingJnfo { 

25 Ul 

u2 string_index; 

} 

30 

The tag field has the constant value 0x , 0008 f indicating a string. The string_index field is a subscript into the con- 
stant pool 565, to a utf-8 format string constant that gives the value to which the String-type object is initialized. 

A field constant pool entry, method reference constant pool entry and interface method reference constant pool 
35 entry represent references to Java fields, methods, and interface methods, respectively. The constant pool entries are 
structured as follows: 

CONSTANT_FicldrcCinfo{ 

40 

ul tag; 

u2 class_index; 



45 



50 



55 
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u2 namc_ajid_type_index; 

) 

CONSTANT_Methodref_info { 
ul tag; 

u2 class — index; 

u2 name_and_type_ixidex; 

} 

CONSTANTjntcrfaceMethodref_info{ 
ul tag; 

u2 clas$_index; 

u2 nanie_and_type_indcx; 



The tag field has the constant value 0x'0009\ Ox'OOOA*, or Ox'OOOBV indicating a field reference, method reference, 
or interface method reference, respectively. The classjndex field is a subscript into the constant pool 565, to a class 
constant that is used to identify the name of the class or interface containing the field or method. The 
name_and_type_indexf ield is a subscript into the constant pool 565, to a NameAndType constant that is used to identify 
the name and signature of the field or method. 

A NameAndType constant pool entry represents a field or method without indicating the class to which the name 
or field, as the case may be, belongs. The constant pool entry is structured as follows: 



CONSTANT^NameAndType^info { 
ul tag; 

u2 namejndex; 
u2 signature_index; 



The tag field has the constant value Ox'OOOC* indicating a NameAndType entry. The namejndex field is a subscript 
into the constant pool 565, to a utf-8 format string constant that gives the name of the field or method. The 
signature_index field is a subscript into the constant pool 565. to a utf-8 format string constant that gives a signature of 
the field or method. The signature, in this context, refers to a string that represents a type of a method, field or array. 
The field signature represents the value of an argument to a function or the value of a variable. A return-type signature 
represents the return value from a method. An argument signature represents an argument passed to a method. A 
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method signature comprises one or more arguments signatures and a return signature, thereby representing the argu- 
ments expected by a method, and the value that it returns. 

The structure and self-referential nature of the cell pool thereby provides great flexibility in implementation of data 
encoded in an applet file. 

5 

access_flags 

The accessjlags field 566 contains a mask of up to sixteen modifiers used with class, method, and field declara- 
tions. The same encoding is used on similar fields in f ield_info and method_infb as described below The accessjlags 
10 field is encoded as follows: 



Rag Name 


Value 


Meaning 


Used By 


ACC 


.PUBLIC 


0x0001 


Visible to everyone 


Class, Method, Variable 


ACC 


.PRIVATE 


0x0002 


Visible only to the defining class 


Method. Variable 


ACC 


.PROTECTED 


0x0004 


Visible to subclasses 


Method, Variable 


ACC 


.STATIC 


0x0008 


Variable or method is static 


Method, Variable 


ACC. 


.FINAL 


0x0010 


No further subclassing, overriding, or assignment 
after initialization 


Class, Method, Variable 


ACC. 


.SYNCHRONIZED 


0x0020 


Wrap use in monitor lock 


Method 


ACC 


.VOLATILE 


0x0040 


Cant cache 


Variable 


ACC 


.TRANSIENT 


0x0080 


Not to be written or read by a persistent object 
manager 


Variable 


ACC. 


.NATIVE 


0x0100 


Implemented in a language other than Java 


Method 


ACC. 


.INTERFACE 


0x0200 


Is an interface 


Class 


ACC. 


.ABSTRACT 


0x0400 


No body provided 


Class, Method 



this_class 

35 

The this_class field 567 is an index into the constant pool 565; constant j300l[this_class]must be of type 
CONSTANT_class. 

superclass 

40 

TTie superclass 568 field is an index into the constant pool 565. If the value of superclass field 568 is nonzero, 
~ then cdns^ritjbbl[super_cl^]must be~a classrahd giv^ the index : of this^a^'s superclass (that is, th^~class from 
which the present class is derived) in the constant pool 565. If the value of superclass field 568 is zero, then the class 
being defined must be java.lang.Object, and it has no superclass. 

45 

i nterfaces_count 

The interfaces_count field 569 gives the number of interfaces that this class implements. 
so interfaces table 

Each value in interfaces table 570 is an index into the constant pool 565. If an table value is nonzero(interfaces[i] 
!= 0, where 0 <= i < interfaces_count), then constant_pool(irrterfaces(i)]must be an interface that this class implements. 

55 fields.c unt 

The f ields_count field 571 gives the number of instance variables, both static and dynamic, defined by the this class 
field. The fields table 572 includes only those variables that are defined explicitly by this class. It does not include those 
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instance variables that are accessible from this class but are inherited from superclasses. 



fields table 



5 



Each value in the fields table 572 is a more complete description of a field in the class. Each field is described by 
a variable length f ieldjnfo structure. The format of this structure is as follows: 



field_info { 



75 



10 



u2 access_flags; 
u2 name_index; 
u2 signature^index; 
u2 attxibutes^count; 

attribute_info attributcs[attxibute w count}; 



20 



} 



The access_f lags field is a set of sixteen flags used by classes, methods, and fields to describe various properties 
and how they many be accessed by methods in other classes. This field has the same names, values and meanings as 
25 the access_f lags field 566 previously disclosed. 

The possible flags that can be set for a field are ACC_PUBLIC, ACC_PRIVATE, ACC_P ROTECTED, ACC_STATIC. 
ACC_FINAU ACCJ/OLATILE, and ACC_TRANSIENT. At most one of ACC_PUBLIC, ACC_P ROTECTED, and 
ACC_PRIVATE can be set for any method. 

The name_index field is a subscript used to index into the constant pool 565 indicating a CO NSTANT_Utf 8 string, 
30 which is the name of the field. 

The signature_indexfield is a subscript that is used to index into the constant pool 565 to indicate a 
CONSTANT_Utf8 string, which is the signature of the field. 

The attributes_countfield indicates the number of additional attributes about this field. 

The attributes field represents the attributes of a particular field represented by the f ieldjnfo structure. A field can 
35 have any number of optional attributes associated with it. For example, the "ConstantValue" attribute, which indicates 
that this field is a static numeric constant, indicates the constant value of that field. 

methods_count 

40 The methods_countfie!d 573 indicates the number of methods, both static and dynamic, defined by this class. This 
table only includes those methods that are explicitly defined by this class. It does not include inherited methods. 



methods table 



45 



Each value in the methods table 574 is a more complete description of a method in the class. Each method is 
described by a variable length method_info structure. The format of this structure is as follows: 



50 



55 
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method_info { 

u2 access_flags; 

5 

u2 name_index; 
u2 signature_index; 
10 u2 attxibutes_count; 

attribute_info attributes[attribute_count); 

} 

15 

The accessjlags field is a set of sixteen flags used by classes, methods, and fields to describe various properties 
(f /- and how they many be accessed by methods in other classes. This field has the same names, values and meanings as 

\ . 20 the accessjlags field 566 previously disclosed. The possible fields that can he set for a method are ACC_PUBLIC, 
ACC_PRIVATE, ACC.PROTECTED, ACC.STATIC ACC_FINAL, ACC_SYNCHRONIZED, ACC_NATIVE, and 
ACC_ABSTRACT. At most one of ACC_PUBLIC, ACC_P ROTECTE D, and ACC_PRIVATE can be set for any method. 

The name_index field is a subscript used to index into the constant pool 565 indicating a CONSTANT_Utf8 string, 
which is the name of the method. 
25 The signature_index field is a subscript that is used to index into the constant pool 565 to indicate a 
CO NSTANT_Utf 8 string, which is the signature of the method. 

The attributes_countfield indicates the number of additional attributes about this method. 

The attributes field represents the attributes of a particular method represented by the method_info structure. A 
method can have any number of optional attributes associated with it. Each attribute has a flame, and other additional 
30 information: For example, the "Code" attribute describes the bytecodes that are executed to perform this method, and 
the "Exceptions" attribute describes the Java Exceptions that are declared to result from the execution of the method. 

attributes_count 

35 The attributes_countfield 575 indicates the number of additional attributes about this class, 
attributes 

\ The attributes table 576 defines the attributes associated with the class. A class can have any number of optional 

40 attributes associated with it. For example, the "SourceFile" attribute indicates the name of the source file from which this 

class file was compiled. 

Because"of the highly structured nature of theTJava CLASS file format; the f fle~f of rrat islDal^cularly^ll-suit^ for 

the purpose of encoding an OID into an applet file. In particular, the OID may be encoded as a attribute associated with 
the applet class. The structured nature of the CLASS format facilitates retrieval of the OID attribute. The OID may be 
45 most easily implemented as an attribute associated with the class and encoded into attribute table 576. It will be 
recalled that attribute table 576 includes all attributes associated with this class, such as the "SourceFile" attribute. An 
additional attribute, the OID attribute may be implemented to provide the OID for the applet. 
The OID attribute has the following format: 
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OID_attribute{ 

u2 attribute_name_index; 
u4 attribute_length; 
u2 oid_index; 



The attribute_name_indexf ield is a two-byte field that provides an index into constant pool 565. The value in the 
attribute_name_indexfield is used to select a constant pool entry of type CO NSTANT_Utf 8 string encoding the charac- 
ter string "OID". 

The attributejengthf ield is a four-byte field containing the value 0x0002, indicating that the following oidjndex field 
is two bytes in length. 

The oid_indexfield is a two-byte field that that provides an index into constant pool 565. The value in the 
oid_indexfield is used to select a constant pool entry of type CONSTANT_Utf8 string. The string encoded in the 
selected CONSTANTJJtf 8 string is the ASN.1 OID value associated with this applet. 

Another file type that lends itself to including anembedded OID is the CompuServe Information Service's Graphics 
Interchange Format (GIF). The GIF file format is described in CompuServe, Inc., Graphics Interchange Format Version 
89a (modified) (January 9, 1995), the disclosure of which is hereby incorporated by reference. The GIF specification is 
primarily geared to the encoding of compressed data that may be used to represent a graphic image. The GIF format 
specifies the optional inclusion of an Application Extension, a data area that may include application-specific data, Fig- 
ure 5B depicts the format of an Application Extension data area 580. 

One-byte Extension Introducer 581 has a value 0x21 , which defines the data area as an extension. One-byte Appli- 
cation Extension Label 582 has a value OxFF, which identifies the extension as an Application Extension. Blocksize 583 
is a one-byte field that defines the size of the block, up to but not including Application Data 586, and has the value OxOB 
(decimal 11). Application Identifier 584 is an eight-byte printable ASCII sequence used to identify the application asso- 
ciated with the Application Extension. A value of 'ASN. 10ID* indicates that the Application Extension 580 is used to 
encode an ASN. 1 OID. Application Authorization Code 585 is three bytes in length, and is optionally used by an appli- 
cation to validate the Application Identifier 585. It need not be employed in the present invention. Application data 586 
is application-dependerrtdata, and may be used to encode an OID. Block Terminator 587 is a one-byte field containing 
a value 0x00, and is used to indicate the end of the Application Extension 580. 

Application data 586 may be used to encode an OID in the following form: 

ASN.l_OID{ 

int length; 
string oid; 

) 



Where the length field is an integer that indicates the length of the OID, and the oid field is a character value of the 
OID associated with the GIF file. 

In the preferred embodiment, following the extraction of the OID from the received file, client computer 210 must 
perform some verification of received object 510, to validate that the received object 510 does, in fact, correspond to 
the OID specified and allocated by the software distributor that has responsibility for the OID. Typically, this may be per- 
formed by transmitting with the object 510 a digital signature associated with the file. Numerous methods of calculating 
digital signatures are known. According to one such methodology, a "message digest" is first calculated based upon the 
contents of the file to be digitally signed, in this case file 51 0. A message digest is the fixed-length result when a variable 
length message or file is provided to a one-way hashing function. A message digest helps verify that a message has 
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not been altered, because a message digest calculated on an altered message would not equal the message digest on 
the unaltered message. After the message digest is calculated, the digest is encrypted using a private key associated 
with the software distributor. A recipient of the message (e.g.. file 51 0) and the encrypted message digest for that mes- 
sage may validate the content of the message by first calculating its own version of the message digest, and then 
5 decrypting the received message digest using a public key maintained and widely distributed by the software distributor, 
and known to be valid and associated with that software distributor because it can be validated with a known registration 
authority. If the calculated message digest is equal to the decrypted received message digest, the message is known 
to be unaltered. 

Message validation may be performed using any of a number of widely available public key cryptography methods. 
w Examples of such methods include Private Communications Technology ( n PCT) from Microsoft, Inc., Secure Hyper- 
Text Transport Protocol ("SHTTP") from Theresa Systems, Shen, Kerberos, Photuris. Pretty Good Privacy ("PGP") and 
Ipv6. 

It should be noted that validation is a required step only in an untrusted environment, where the status of received 
objects is not known or is not trusted. In a trusted environment, in which the integrity of the object IDs and their associ- 

15 ated resources are known to be valid, or where there is a high degree of confidence in the O ID-resource coherency and 
the consequences of incoherence are minimal, the validation step may be ignored. For example, in a non-mission- crit- 
ical environment in which all clients and all servers are under the control of the authority responsible for all of the OlDs, 
the validation of received objects might be ignored. 
; v Figure 6 depicts transmission of a second HTML document 61 0 from a second HTTP server 61 5, named SYSB, to 

20 client 210 over communications link 620. Upon receipt of document 610, client 210 updates cache 230 as previously 
described. That is, client 210 adds a new row 238-3 in which URI column 245 is set to the URI for file 610, OID column 
247 is set to null, and cache pointer column 249 is set to point to cache element 620. 

Figure 7 depicts an example of HTML document 610 illustrative of the present invention. As with previous docu- 
ment 225. HTML document 61 0 comprises a number of HTML tags 710, including applet tag 730. Applet tag 730 com- 

25 prises a CODE parameter 740, a WIDTH parameter 742, and a HEIGHT parameter 744, and OID parameter 750. It will 
be noted that the values for the CODE, WIDTH, and HEIGHT parameters in applet tag 730 are arbitrary and are not the 
same as those previously shown for Applet tag 330 in Figure 3. However, it will be noted that the OID parameter 750 
has a value of 1.1.999999.72.6.3. which is identical to the previously obtained object, as can be seen by reference to 
cache table row 238-2 in Figure 5. Therefore, it will be seen that there are two different methods by which client 210 

30 may obtain a copy of the desired object. It may issue a second HTTP request to server SYSB 615 using the URI, or it 
may retrieve the corresponding cache element 520 from the cache. 

Figure 8 depicts the process by which client computer 210 obtains a copy of the desired object. Using the OID as 
specified in document 610, client computer 210 traverses cache table 235, searching for a matching OID value in col- 
umn 247. As can be seen from Figure 8, a match is found in row 238-2. Client computer 210 then references the cor- 

35 responding cache pointer column 249 for row 238-2, which points to cache element 520 within cache 230. Client 
computer 210 then copies cache element 520 from cache 230 to its own memory as shown by arrow 840. It will be 
noted that the copy loaded, as shown by object 81 0. is equal to the contents previously downloaded as shown in Figure 
5 from server SYSA. and is not freshly loaded from server SYSB 615. That is, the cache copy was obtained without 
regard for the location from which it was originally obtained, or from the location specified in HTML document 61 0. 

40 Thus, Figures 2 through 8 depict a location-independent means of obtaining copies of resources, which would oth- 
erwise require redundant transmission of copies of resources identical to copies that had already been previously 

obtained-Although the examples shown depict retrieval of a data object in the form of ~a Java applet or a GIF f ileT the 

invention may also be applied to retrieval of any machine-readable resource, for example, a portion of a computer pro- 
gram such as a subroutine or other program segment, a text file, a sound file, or a data object that comprises a collec- 
ts tion of multiple data objects. 

Figure 9 is a flow chart depicting the overall operation of the invention. Execution begins in step 910. In step 920, 
the client computer determines what resource is desired to be obtained, e.g., by examining an applet tag in a previously 
downloaded HTML file. In step 925, the client computer checks to see whether an OID was specified for the desired 
resource. K no OID was specified, execution proceeds with step 930. In step 930, the client computer obtains a copy 

so using the URI, for example by issuing an HTTP GET request to the server corresponding to the URI. 

In step 935, the client verifies that an OID was encoded into the retrieved copy. If so, control proceeds to step 940. 
In step 940, the client checks to see whether the OID is valid, for example, by verifying a digital signature. If so, control 
proceeds to step 950. In step 950, client computer 21 0 updates the OID status in the cache for the file obtained. If either 
no OID is specified in the retrieved copy, or an OID is specified but is not valid, step 950 is skipped, and control pro- 

55 ceeds to step 990 where processing of the retrieved object is complete. As noted previously, the steps of verifying the 
OID from the copy of the retrieved object, and validating the OID against the received object, are optional and may be 
omitted in certain environments. 

Referring again to step 925, if an OID was specified, control proceeds to step 970. In step 970. the client computer 
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inspects the cache table to determine whether an object with the desired object identifier has previously been down- 
loaded and is available in the cache. If no such object is found in the cache, control proceeds to step 930, as if no OID 
had been specified. If the desired OID entry is found in the cache table, however, control proceeds to step 975, in which 
the client computer retrieves the cache copy, loads it into memory and proceeds to execute using the copy retrieved 

5 from the cache. In any event, after step 975, processing for the object exits at step 990. 

Figure 10 depicts an alternate embodiment of the present invention. As shown in Figure 10, a plurality of clients 
1010 and 1020 are in communication with a firewall server 1040 using communication links 1030. Firewall server 1040 
may be, for example, a proxy server, a firewall or a gateway. Cache 1 050 is operated under the control of firewall server 
1 040. In response to request from clients 1 01 0 and 1 020, firewall server 1 040 makes requests to various servers in net- 

10 work 1 060 over communication link 1070. 

In the embodiment depicted it is not necessary for clients 1010 and 1020 to maintain their own caches. Instead, 
requests for resources are provided to the firewall server 1 040 and include both the Iocation<lependent URI associated 
with the specific copy of the resource, as well as the location independent OID (if any) associated with the resource. 
Firewall server 1040 maintains cache 1050 as previously described. When an object with an OID corresponding to an 

is OID requested by one of clients 1010 or 1020 is found in cache 1050. firewall server 1040 responds to the respective 
client with a copy of the object as obtained from the cache. If no such object is found in the cache, firewall server 1040 
uses the supplied URI to obtain a new copy from network 1060. 

While the invention is described in terms of preferred embodiments in a specific system environment, those skilled 
in the art will recognize that the invention can be practiced, with modification, in other and different hardware and soft- 

20 ware environments within the spirit and scope of the appended claims. 

Claims 

1 . A method for obtaining a copy of a data object, comprising the steps of: 

25 

(a) obtaining a location-independentidentif ier for the data object from a primary file; 

(b) interrogating a cache to determine whether a copy of the data object is cached; 

(c) if the data object is cached, obtaining a copy of the data object. 

30 2. The method as recited in claim 1 , further comprising the step of: 



(d) if the data object is not cached, performing a network call to obtain a copy of the data object. 



3. 


The method as 


recited in claim 


1, 


in which steps (b) and (c) are performed by a client program. 


4. 


The method as 


recited in claim 


1. 


in which the location-independent identifier is an ASN.1 object identifier. 


5. 


The method as 


recited in claim 


1. 


in which the data object comprises logic and data. 


6. 


The method as 


recited in claim 


1, 


in which the data object is a computer program. 


7. 


The method as 


recited in claim 


1, 


in which the data object is a class object. 


8. 


The method as 


recited in claim 


1, 


in which the data object is a program subroutine. 



9. The method as recited in claim 1, in which the executable program segment is in a platform-independentobject 
code form. 

10. The method as recited in claim 1 in which the data object is an image file. 

50 

1 1 . The method as recited in claim 1 in which the data object is a text file. 

12. The method as recited in claim 1 in which the data object is a multimedia file. 
55 1 3. An apparatus for obtaining a copy of a data object, the apparatus comprising: 

(a) a computer and a cache, 

(b) the cache being indexed by a cache table, 
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(c) the computer being responsive to requests for a data object having a location-independentidentifier, 
whereby the computer interrogates the cache to determine whether the data object is cached, and if the data 
object is cached, obtains a copy of the data object from the cache. 

5 14. The apparatus as recited in claim 13, whereby if the data object is not cached, the computer performs a network 
call to obtain a copy of the data object. 

15. The apparatus as recited in claim 13, in which the location-independent identifier is an ASN. 1 object identifier. 

io 16. The apparatus as recited in claim 13, in which the data object comprises an executable program segment. 

17. The apparatus as recited in claim 13, in which the data object is a computer program. 

18. The apparatus as recited in claim 13, in which the data object is a class object. 

15 

19. The apparatus as recited in daim 13, in which the data object is a program subroutine. 

20. The apparatus as recited in claim 13, in which the executable program segment is in a platfornvindependentobject 
code form. 

20 

21. The apparatus as recited in claim 13, in which the data object is an image file. 

22. The apparatus as recited in claim 13, in which the data object is a text file. 
25 23. The apparatus as recited in claim 13. in which the data object is a sound file. 

24. A system for the transmission of a specified data object, the system comprising: 

(a) a client computer having a cache, the cache being indexed by a cache table; 
30 (b) at least one server computer; 

(c) the client computer and the server computer coupled by a communications link; 

(d) the computer being responsive to requests for a specified data object having a location-independentidenti- 
fier, whereby the client computer interrogates the cache to determine whether the data object is cached, and 
if the data object is cached, obtains a copy of the data object from the cache, and otherwise calls the server 

35 computer to obtain a copy of the data object. 

25. The system as recited in claim 24, in which the location-independent identifier is an ASN. 1 object identifier. 

26. The system as recited in claim 24, in which the data object comprises an executable program segment. 

40 

27. The system as recited in claim 24, in which the data object is a computer program. 



28. The system as recited in claim 24, in which the data object is a class object. 

45 29. The system as recited in claim 24, in which the data object is a program subroutine. 

30. The system as recited in claim 24, in which the executable program segment is in a platfornvindependentobject 
code form. 

so 31. The system as recited in claim 24, in which the data object is an image file. 

32. The system as recited in claim 24, in which the data object is a text file. 

33. The system as recited in claim 24. in which the data object is a sound. 

55 



BNSDOCID: <EP_0834818A2J_> 



21 



EP 0 834 818 A2 



Fig. 1 



20 



10 



CPU 



16 



ROM 



14 



RAM 



r 



NETWORK 



18 



34 



I/O 
ADAPTER 



COMMUNICATIONS 
ADAPTER 




083481 8A2_I_> 



22 



EP 0 834 818 A2 




23 

BNSDOCID: <EP 083481 8A2_I_> 



EP 0 834 818 A2 



310. 




330 < 



Fig. 3 




225 



320 

I 

/ 



HTML> 
HEAD> 

, 1 

TITLE> ; Welcome to System A \ </TITLE> 

</HEAD> 



:BODY> 



340 



342 

:APPLET> J CO 



/ 34^ 
DE=F00jc 



344 
CLASS 

WIDTH=300 HEIGHT=100 



OID=1 .1 .999999.72.6.3> 
</APPLET> 350 



</BODY> 
</HTML> 



SYSA7PAGE1.HTML 



BNSDOCID: <EP 083481 8A2_I_> 



24 



EP 0 834 818 A2 





BNSDOCID: <EP__0834818A2_I_> 



25 



EP 0 834 818 A2 



Fig. 5A 



561 
562 

564 

565 
566 

568 
570 



571 
572 

573 

574 

575 
576 



560 



OXCAFEBABE 



MINOR VERSION 



CONSTANT.POOL COUNT 



MAJOR VERSION 



563 



CONSTANT POOL 



ACCESS FLAGS 



SUPERCLASS 



THIS CLASS 



INTERFACE COUNT 



INTERFACES 



FIELDS COUNT 



FIELDS 



METHODS.COUNT 



METHODS 



ATTRIBUTES.COUNT 



1 



ATTRIBUTES 



567 
569 



T 



26 



BNSDOC1D <EP 0834818A2_I_> 



EP0 834 818 A2 



Fig. 5B 



"ASN.10ID" 



580 





AUTHORIZATION 
CODE 
BYTES 1-2 



585 



APPLICATION 
DATA 




27 



EP 0 834 818 A2 




EP 0 834 818 A2 



Fig. 7 



625 




HTML> 



710 



HEAD> 



<TITLE> System B Introduction Page </TITLE> 
-</HEAD> 

<BODY> 

Welcome to SYSTEM B. 

<P> 742 7 *° 744 
<APPLET> J CODE=BApJci_ASS 



WIDTH=320 HEIGHT=120 



7301 



</APPLET> 750 




</BODY> 



</HTML> 



SYSB/INTRO.HTM 



29 



BNSDOCID: <EP 083481 8A2_I_> 



EP 0 834 818 A2 



( BEGIN ^ — 910 





T 


DETERMINE 
RESOURCE 
TO BE ~ 
OBTAINED 




r 



920 





RETRIEVE 
COPY FROM 
CACHE ^ 



975 



Fig. 9 



OBTAIN 
COPY USING 
URI _J 



930 



N. 



OID 

3N RETRIEVED^ 
COPYj 

9 



935 




940 



ADD OID . 
POINTER 
IN CACHE 



950 



( EXIT 



990 



31 



BNSDOCID: <EP 083481 BA2_I_> 



EP 0 834 818 A2 



1010 



Fig. 10 




1040 



FIREWALL 



1060 



1070 





1050- 



BNSDOCID: <EP 08348 18A2_I_> 



32 



(19) 



J 



(12) 



(88) Date of publication A3: 

31.03.1999 Bulletin 1999/13 

(43) Date of publication A2: 

0&04.1998 Bulletin 1998/15 

(21) Application number: 97110837.8 

(22) Date of filing: 01 .07.1 997 



Eur pdisches Patentamt 
Eur pean Patent Office 
Off ice europS ndes brevets (11) EP 0 834 818 A3 

EUROPEAN PATENT APPLICATION 

(51) IntCI. 6 : G06F 17/30 



(84) Designated Contracting States: 


(72) Inventor: Herriot, Robert G. 


AT BE CH DE DK ES F1 FR GB GR IE IT LI LU MC 


Palo Alto, California 94301-4003 (US) 


NL PT SE 






(74) Representative: 


(30) Priority: 02.07.1996 US 675237 


Kindermann, Manfred 




Patentanwalt, 


(71) Applicant: 


Sperberweg 29 


SUN MICROSYSTEMS, INC. 


71032 Bdblingen (DE) 


Mountain View, California 94043-1100 (US) 





(54) System, method, apparatus and article of manufacture for identity based caching 



(57) A process for obtaining a copy of a data object 
is disclosed. A location-independent identifier associ- 
ated with the desired data object is obtained, for exam- 
ple, from a primary file that requires a copy of the data 
object. A cache is interrogated to determine whether a 
copy of the data object is cached. If the data object is 
cached, a copy of the cached data object is obtained 
from the cache. If the data object is not cached, a net- 
work call is performed obtain a new copy of the data 
object 



Q BEGIN 3 ^-910 



Fig. 9 



DETERMINE 
RESOURCE 
TO BE - 
OBTAINED 



-920 




925 



970 



RETRIEVE 
COPY FROM 
CACHE 



-930 



935 



-975 




950 



C EXIT ^ - 



990 



Printed by Xerox (UK) Business Services 
2.t6.7/3.6 



EP 0 834 818 A3 



J 



European Pat nt 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 97 11 0837 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document with indication, where appropriate, 
of relevant passages 



ANDERSON T E ET AL: "SERVERLESS NETWORK 
FILE SYSTEMS" 

OPERATING SYSTEMS REVIEW (SIGOPS), 
vol. 29, no. 5, 1 December 1995, pages 
109-126, XP000584821 

* page 113, right-hand column, line 18 - 
page 116, left-hand column, line 20 * 

ANONYMOUS: "Extensible, Language 

Independent Method of Specification for 

Aribtrary ASN.l Values" 

IBM TECHNICAL DISCLOSURE BULLETIN, 

vol. 37, no. 4B, April 1994, pages 

353-356, XP002091096 

New York, US 

* the whole document * 

ANONYMOUS: "Name Server Based NAMING 
Algorithms With the Concept of Assigned 
Pre- Fixes in Distributed Directory 
Environment" 

IBM TECHNICAL DISCLOSURE BULLETIN, 
vol. 28, no. 8, January 1986, pages 
3583-3588, XP002091097 
New York, US 

* the whole document * 



The present search report has been drawn up for all claims 



,2,5,6, 
8-14,16, 
7, 

9-24, 
26,27, 
29-33 
4,15,25 



4,15,25 



Relevant 
to claim 



1,13,24 



CLASSIFICATION OF THE 
APPLICATION (lnt.CI.6) 



G06F17/30 



TECHNICAL FIELDS 
SEARCHED (lnt.CI.6) 



G06F 



Place ot search 

BERLIN 



Date ot completion ot the search 

26 January 1999 



Examiner 

Deane, E 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant it taken alone 

Y : particularly relevant if combined with another 

document of the same category 
A : technological background 
O : non -written disclosure 
P : intermediate document 



T : theory or principle underlying the invention 
E : earlier patent document, but published on. or 

after the filing date 
D : document cited in the application 
L : document cited for other reasons 

& : member of the same patent family, corresponding 
document 



2 



BNSDOC1D- <EP 083481 8A3J_> 



